A hybrid approach to 3d tongue modeling from vocal tract MRI using unsupervised image segmentation and mesh deformation
نویسندگان
چکیده
Vocal tract magnetic resonance imaging (MRI) has become one of the preferred imaging modalities for the analysis of human speech production. However, the raw image data must be segmented before further analysis can take place. This paper describes a hybrid approach to extract a 3D tongue model from 3D or 2D MRI scans of the vocal tract during speech, which combines unsupervised image segmentation with a mesh deformation technique. An efficient, minimally supervised segmentation algorithm can also be used as an alternative to provide a robust fallback in certain isolated cases. Both image segmentation algorithms produce a point cloud, which is completed and registered by deforming a template mesh to the data. Since the mesh deformation can be applied even with a sparse point cloud, it is possible to extract realistic 3D tongue shapes even from the 2D video frames of real-time MRI. Our approach is applied to several sets of available MRI data and yields promising results.
منابع مشابه
Tongue Mesh Extraction from 3D MRI Data of the Human Vocal Tract
In speech science, analyzing the shape of the tongue during human speech production is of great importance. In this field, magnetic resonance imaging (MRI) is currently regarded as the preferred modality for acquiring dense 3D information about the human vocal tract. However, the desired shape information is not directly available from the acquired MRI data. In this chapter, we present a minima...
متن کاملExtraction and 3D Segmentation of Tumors-Based Unsupervised Clustering Techniques in Medical Images
Introduction The diagnosis and separation of cancerous tumors in medical images require accuracy, experience, and time, and it has always posed itself as a major challenge to the radiologists and physicians. Materials and Methods We Received 290 medical images composed of 120 mammographic images, LJPEG format, scanned in gray-scale with 50 microns size, 110 MRI images including of T1-Wighted, T...
متن کامل3D segmentation of the tongue in MRI: a minimally interactive model-based approach
Static magnetic resonance imaging partially resolves soft tissue details of the oropharynx, which are crucial in swallowing and speech studies. However, delineation of tongue tissue remains a challenge due to the lack of definitive boundary features. In this article, we propose a minimally interactive inter-subject mesh-to-image registration scheme to tackle 3D segmentation of the human tongue ...
متن کاملPolygonal Mesh Comparison Applied to the Study of European Portuguese Sounds
The purpose of the authors’ study was to evaluate the feasibility of using a mesh comparison tool in the study of European Portuguese speech sounds. A large 3D MRI database from several speakers, including various sounds and contexts has been acquired. Segmentation, visualization, and analysis of such a large database are complex, time-consuming tasks, preventing the use of manual segmentation ...
متن کاملSegmentation and 3D reconstruction of the vocal tract from MR images – a comparative study
Speech production is an important human function involving a set of organs with specific morphological and dynamic aspects. The inter-speaker variability, the coarticulation or the nasality are some interesting aspects to improve a realistic 3D modeling of the vocal tract. For this, the understanding of the mechanism of speech production is crucial, as the current image data is not sufficient t...
متن کامل